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Abstract 

Background: Verbal autopsy (VA) is a widely used method for analyzing cause of death in absence of vital 
registration systems. We adapted the InterVA method to extrapolate causes of death for stillbirths and neonatal 
deaths from verbal autopsy questionnaires, using data from Malawi, Zimbabwe, and Nepal. 

Methods: We obtained 734 stillbirth and neonatal VAs from recent community studies in rural areas: 169 from 
Malawi, 385 from Nepal, and 180 from Zimbabwe. Initial refinement of the InterVA model was based on 100 
physician-reviewed VAs from Malawi. InterVA indicators and matrix probabilities for cause of death were reviewed 
for clinical and epidemiological coherence by a pediatrician-researcher and an epidemiologist involved in the 
development of InterVA. The modified InterVA model was evaluated by comparing population-level cause-specific 
mortality fractions and individual agreement from two methods of interpretation (physician review and InterVA) for 
a further 69 VAs from Malawi, 385 from Nepal, and 180 from Zimbabwe. 

Results: Case-by-case agreement between InterVA and reviewing physician diagnoses for 69 cases from Malawi, 
180 cases from Zimbabwe, and 385 cases from Nepal were 83% (kappa 0.76 (0.75 - 0.80)), 71% (kappa 0.41(0.32- 
0.51)), and 74% (kappa 0.63 (0.60-0.63)), respectively. The proportion of stillbirths identified as fresh or macerated by 
the different methods of VA interpretation was similar in all three settings. Comparing across countries, the 
modified InterVA method found that proportions of preterm births and deaths due to infection were higher in 
Zimbabwe (44%) than in Malawi (28%) or Nepal (20%). 

Conclusion: The modified InterVA method provides plausible results for stillbirths and newborn deaths, broadly 
comparable to physician review but with the advantage of internal consistency. The method allows standardized 
cross-country comparisons and eliminates the inconsistencies of physician review in such comparisons. 



Background 

Cause-specific mortality data on childhood deaths are 
vital to identify health needs, compare patterns of death 
across populations, plan and monitor interventions, and 
inform policy [1-3]. In high-income countries, all births 
and deaths are enumerated through vital registration 
systems, and death certification is routine. In low- 
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income settings, most births and deaths occur at home, 
death certificates are rarely available, and vital registra- 
tions are often inadequate or nonexistent [2-4]. 

Verbal autopsies (VAs) provide an alternative means 
of identifying probable causes of death through inter- 
views with a close caregiver of the deceased, in which 
information about the circumstances, signs, and symp- 
toms leading to death are gathered. VAs have limita- 
tions: they require recollection of events at the time of 
death, rely on understanding and reporting of signs and 
symptoms by interviewees, and may be influenced by 
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interviewer skills. The data must also be interpreted to 
establish a diagnosis [5]. Conventionally, VA question- 
naires are read by two or more physicians separately 
and one or more causes of death are attributed. A cause 
of death is established when physicians' opinions corre- 
spond; otherwise diagnosis is reconsidered and discussed 
with or without the input of an additional physician. If 
no agreement is reached, the cause of death is consid- 
ered undetermined. Repeatability of this diagnostic pro- 
cess over time and in different settings is problematic, 
particularly when diagnostic criteria are not standar- 
dized amongst different clinicians [6-8]. In some situa- 
tions, disagreement between physicians is such that a 
large proportion of causes of death remain indetermi- 
nate [7,9]. Moreover, the method is costly, time-con- 
suming, and requires the involvement of physicians who 
are an already overstretched resource in low-income 
countries [6,10]. 

Despite these limitations, VAs are useful in estimating 
cause-specific mortality fractions (CSMFs) in population 
studies [6,8,11]. They have been used extensively in epi- 
demiological studies, household surveys, and sentinel 
surveillance sites, and have been piloted in subsamples 
from sample registration systems. There remains a need 
to refine the technique to make it more comparable, 
repeatable, easy to apply, and cost-effective. 

VA questionnaires devised by the WHO attempt to 
standardize the interview process, but more standar- 
dized approaches to interpreting VA data are needed. 
Hierarchical algorithms and computer programs based 
on logistic regression have been used, but they are diffi- 
cult to standardize across cultures and age groups and 
can usually only identify single causes of death [12,13]. 
InterVA uses a probabilistic method and has been tested 
in a range of settings for deaths at all ages, across sexes, 
and for maternal deaths [14,15]. 

We describe the refinement and evaluation of InterVA 
to identify causes of death in the perinatal (stillbirths 
and neonatal deaths in the first seven days) and neonatal 
periods, using data from three different settings: Malawi, 
Zimbabwe, and Nepal. 

Methods 

Based on Bayes' theorem [16], the InterVA model calcu- 
lates the probability of a set of causes of death given the 
presence of circumstances, signs, and symptoms (collec- 
tively called 'indicators') reported in VA interviews. The 
method is described in detail elsewhere [10,17]. Briefly, 
a finite number of causes of death are assigned to a pre- 
defined matrix of estimated probabilities of occurrence. 
The presence of indicators (Table 1) modifies the prede- 
fined probabilities of each cause of death upward or 
downward using Bayes' theorem according to the for- 
mula 



rv ' p(I|C)*p(C)+p(I|!C)*p(!C) 

where p (C|I) indicates the probability of a cause of 
death (C) given the presence of the indicator (I) and p 
(I/!C) is the probability of I in the absence of C [10]. 

Probabilities of final-cause categories increase or 
decrease in relation to specific signs and symptoms 
reported in the VA interview. If symptoms are not 
reported, the probabilities do not change. The program 
is available online http://www.InterVA.net. Users can 
enter the data as single cases or in batches, and the 
model generates up to three causes of death and their 
respective likelihoods. Prior to the current study, the 
probability matrix consisted of 34 cause of death classifi- 
cations and 104 indicators [10]. 

Data sources 

To explore the performance of InterVA in different set- 
tings, 734 stillbirth and neonatal VAs were obtained 
from rural areas of three low-income countries. 

In Malawi (Mchinji District), 169 stillbirth and neona- 
tal VAs were collected from 2004 to 2005, as part of a 
cluster-randomized study evaluating two community 
interventions to improve maternal and child health [18]. 
Although designed for the study, the VA questionnaire 
was comparable in structure and content with the sub- 
sequent WHO questionnaire [19]. Completed question- 
naires were interpreted independently by two Malawian 
pediatricians, who assigned up to three causes of death 
on the basis of a hierarchical classification and algorithm 
[20] . They were able to use alternative diagnoses where 
necessary. Discrepancies were resolved by discussion 
and, if consensus could not be reached, the cause of 
death was recorded as indeterminate. 

In Nepal (Makwanpur district), 385 VAs were collected 
from 2001 to 2003 as part of a cluster-randomized study 
of a community intervention to improve maternal and 
child health [21]. The questionnaire was again compar- 
able with the subsequent WHO tool. Questionnaires 
were interpreted independently by two Nepalese pediatri- 
cians, who each assigned a single cause of death on the 
basis of the same algorithm used in Malawi. Discrepan- 
cies were resolved after review by a third physician. 

The third data source included 180 neonatal deaths 
from Zimbabwe, identified as part of a maternal and 
perinatal mortality study conducted in 2007 and 2008 
[22]. Neonatal VAs were conducted using the WHO 
tool. Questionnaires were interpreted independently by 
two physicians, who each assigned a single cause of 
death using the International Classification of Diseases 
and Related Health Problems (ICD-10). Discrepancies 
were resolved after review by a third physician (Table 2). 
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Table 1 InterVA indicators and cause of death categories 



Indicators Cause of Death Categories 



wac thic; an o npr fa^-i-l- vparc 
vvdp u il j oil ciuci ujt ycdip 


anv rhrnnir/rpr irrpnt Hiarr (A.-1-\aA 

d 1 ly UI 1 1 UI 1 IL/ 1 CLU 1 1 CI 1 L U Id 1 1 pTVV j 


Pari nata ac nh\/via 
rcinidLdi apuiiyAia 


wss this sn 3ci u It 50-64 yesrs 


3ny BbdominB swelling 


Congenitsl mslformstion 


wss this s femsle 15-49 yesrs 


sny vomiting 


Premsturi ty 


iA/3c tnic a ma p 1 ^-AQ \/parc 
vvdp LI il j u 1 1 lent: lu ycdip 


anv \/p nu/npec /iai inHir~P 
di ly ycriiuvvi icrpp/jdui iuiuc 


T"pta n i i c 
1 CLOI 1 Up 


\Anc tnic a rhi n ^-14 \ rc 
Wdp LMIp d LMIIU j I *+ ycdip 


an\/ annnrma r\v i trino 
diiy dUIIUIIIIdlliy ul Ulllltr 


Pnoi im/~\nia 
r 1 IcUl 1 1UI lid 


\Anc 1~ n i C a rhi n 1 -^d. VP a 

Vvdp LMIp d LMIIU 1 H ytrdli 


an\/ i innan/ rpfpntinn 
diiy Ul II Idly IfcrLfcrllLIUII 


t\A a 1 aria 
IVld Idl Id 


wss this 3n infsnt 4 wks-1 yr 


3ny h3em3turi3 


yn-ir |qc 

iviedsies 


W3s this 3 neonste < 4 wks 


sny swelling of 3nkles/legs 


fv/ldninniTic 

iviei iiny iui 


wac chip nrpnnant at Hpath 

VvGj P 1 IC U 1 Cy I I a 1 1 I d L UCd LI 1 


nn hilatpral ^wpllinn nf anklp 

1 IU UlldLCTIdl jyVCIIM ly Ul dl ir\icr 


P)ia rrhpa 
l/ id 1 1 1 ica 


HiH nrpnnanr\/ pnrl within n wppkc 
u iu u i cy i i a i i uy c i iu vviuiiii u v v cc ro 


anv ^kin lp^inn^/i ilrpr^ 
diiy pi\iii icjiui i j/ uilci j 


RlnnHv Hiarrhpa 
uiuuuy uiaiiiica 


HiW Tina i riQcc act" at" pact ^ va /pp kc 
UIU 1 1 1 Id I 1 1 1 1 Icpp IdPl dL IcdpL D VVCCIxp 


an\/ racn ( nrin-mp^ c qci 
diiy [dill IJIUIIlI ItrdilcpJ 


WLI1CI dLULc llllcLLIUII 


did fins illness Isst < 3 weeks 


sny herpes zoster 


Mslnutrition 


wss desth very sudden/unexpected 


sny messles rssh 


Kwsshiorkor 


W3s desth during wet sesson 


sny excessive night sweBts 


HIV/AIDS relsted 


\A/ac Hpath Hi irinn Hrv cpac,nn 

V Vd P U Cd L 1 1 i '.. ! 1 1 1 '.. 1 U 1 y jCGjUI 1 


anv pvrpt;t;i\/p watpr intakp 

dl ly tTAUCTPPlVC: VVGLtTI II 1 Ld IXCT 


Pi ilmnnarv ti ihprn ilocic 
r uii i iui iaiy lliuciuliiupip 


ifl/ac c/np in a tranennrt arrinpnt 
vvdp p/ I ic M I a li d I ipuui L duuiuci I L 


an\/ pvrpec i\/p i irinatinn 
d i ly CALciJi vc uinidLiuii 


r~hrnni/" inTP^tinn 

\^.IIIUIIIU IINCULIUII 


HiH c/hp Hrn\A/n 
uiu p/ I I c u I u v v I I 


an\/ pvrpec \\/p< fr\r\r\ intakp 
di ly calcjjivc iuuu ii iLdi\t: 


K/latprna rai kpc 
1 Via LCI 1 Id 1 UdUPCP 


nan c/hp fa pn rprpnt tw 
i idu p/ 1 ic idiicii i cuci i Liy 


an\/ an itp fpv/pr 
al ly qlulc level 


Af~i ito rpc rii ratnrv/ Hicpacp fnrit nnpm ^ 

AAL.U LC 1 CpUI 1 d LUI y UlpCdpC ^IIUL UIICIII./ 


an\/ nnienninn nitp ctinn 
di ly uuipui hi ly, uilc, plii ly 


an\/ i~iPrc ictpnt fp\/pr 0 VA/k^ 
ally uci pip lci I L I c vci \s z vv i\^ 


r~hrnni/~ rpcniratnrx/ nicpacp 
v_lliuim_ icpuiidLUiy uipcdpc 


wss s/he 3 known smoker 


sny enlsrged/swo en glsnds 


Acute csrdisc 


an\/ nhvini ic, rprpnt iniiirv 
diiy uuviuuj i cue i i l iiijuiy 


an\/ farial t;\A/pllinn 
ai ly lauiai pvvciiii ly 


(^hrnnir rarrliar 

v„IIIUIIIU LQI UlaL 


wac c/hp known tn Hrink alrnhnl 

V V d P P/ IIC M lUvvl 1 LU U 1 1 1 1 1\ d 1 UU 1 1 U 1 


\A/a^ thprp a rnma "> 74 hr^ 
vvap liicic a uuiiia s zj^ i 1 1 p 




an\/ ci mnp^tinn of hnmiriHp 
diiy puyycPLiuii ui i i u 1 1 1 1 ui u c 


an\/ \A/pinht 

ai ly vvciyi il iupp 


Hia hptpc 

L> 1 d UC LCP 


an\/ rrin\/i i c innc nr rite: 
d 1 ly UUI IVUIjIUI 13 UI 1 1 LP 


an\/ anapmia/na pnpec 
ai ly ai idci 1 lid/ udici icpp 


MainnanrA/ 
iv idi ly i iai icy 


sny disgnosis of epilepsy 


sny drowsiness 


Liver disesse 


wss the fontsne e rsised 


sny delsyed/regressed development 


Kidney disesse 


iA/^c thp rnnt^np p r~\r p\/pna ci mkpn 
VVdp LI Ic 1 Ul 1 Ld 1 lci IC UI cycUd II pU 1 1 KCI 1 


anv/ Hiannr^cic ^t actn nn a 
ally UldyilUpli Ul dpLlllild 


LJIpUIUCIp Ul LI Ic UiyCpLIVC pypLclll 


sny hesdsche 


sny disgnosis of disbetes 


Disesses of the nervous system 


wss there psrslysis on both sides 


sny disgnosis of hesrt disesse 


Sickle cell snemis 


anv nara vc ic Aa/p3 knpec nn 1 ciHp 
diiy udiaiyp|p/vvcdr\iicpp u i i i p i u c 


anv Hiannn^i^; nf HIV/A I 
ai ly uiayiiupip ui i \\v/r\\uj> 


Tra nc nnrt-rplatpH arrinpnt 
i i ai ipuui l iciaLcu auuiuciiL 


an\/ ctiff nprk 
d i ly p li 1 1 i icuis. 


anv Hiannncic j-it n\/nprtpncinn 
ai ly uiayi iupip ui i lyuci lci ipiui i 


A f~r"iHpnta 1 nnienninn 

AAUUIUCI 1 LCI 1 UUIpUI III iy 


sny orsl csndidissis 


been dischsrged from hospitsl very ill 


Accidentsl drowning 


a n\/ riniH itv/lnrkia w 
d i ly i ly iu i Ly/ iuui\jd vv 


anv cii innpctinn nf ci iir~inp 
ai ly puyycpLiui i ui puiuiuc 


nthpf arrinpnt 

WLI ICI dUUIUCI IL 


annnrma nair rn nrinn 
d ui iui 1 1 id I iidii UU1UI 11 ly 


an\/ ci irnpn/ ii ict npfnrp Hpath 
ai ly puiyciy jupl uciuic ucoli i 


1— Inm irinp 

1 IUI 1 1 IUIUC 


an\/ rni inhinn witn n jni*~iH 
di ly Luuyi in ly vv i li i uiuuu 


anv niannncic r\f TR 
ai ly uiayi iupip ui i d 


Suicide 


an\/ rhpct main 
d i ly ui icj l [Jd 1 1 i 


\/\;ac c/hp anpni latplv \/arrinatPn 
vvap p/ 1 ic aucLiuaLciy vauun iolcu 




\A/ac thprp a rni inh tor S 3 wkc 
vvdp li icic d luuui i iui s j vvi\p 


an\/ niannncic r\f M\/pr Hicpacp 
ai ly uiayi iupip ui iivci uipcdpc 


AHrlitinnal inHiratni'c; 

nUUI LIVJI let 1 II ILIILa LU 1 o 


\A/ac thprp a mi inn fnr i in tn wkc 

VVdP LI ICIC d LUUUI 1 IUI UU LU -J VV1\P 


an\/ niannncic nt rsnrpr 
ai ly uiayi iupip ui uai iuci 


HiH hahu h3\/fi arrhpH harlf aftpr 0 Ha\/Q 
uiu uauy iiavc aiLi icu ua\.i\ aiLCi uayj 


3ny productive cough 


sny disgnosis of stroke 


b^by stopped sucking sfter d^y 3 


sny rspid bresthing 


sny disgnosis of mess es 


did the baby die on day 1 


3ny bresthlessness on exertion 


sny disgnosis of kidney disesse 


f\ i r\ trio m nthar T ^ i 1 ^ r\ a \\r a fa-f a n i i c + r^v/^ i r\ i/^rrlno 
UlU iMc moincl Tail T.O icCclvc IcTailUb TOXOIO VaCCiric 


an\/ hrpatn Ipcc: npec k/inn flat 
d i ly ui trd li iicjji icpp ly i i ly i id l 


anv niannncic /~\t nPinnn nhinnnath\/ 
ai ly uiayi iupip ui i ici i luyiuun luudLi ly 


HiH ffinui ilcinnc h^nnpn nn Hav 1 

UIU K.KJI I V U IPIUI I 5 1 la|J[JCI 1 UI 1 Uay 1 


an\/ rhpet inHra\A/inm 
diiy uiicpl iiiuidvviiiy 


anv niannncic nf ma aria 
ai ly uiayiiupip ui iiididiid 


iA/ac thpfp nn rru/mnup/hrpath at hirth 
vvap liicic i iu *-i yi 1 1 iu v c/ ui cci li i ai uiilii 


any difficulty breathing 


any delivery complications 


was baby's skin puffy/mushy at birth 


any breast lump or lesion 


any heavy bleeding around delivery 


did the baby fail to cry at birth 


any wheezing 


was there prolonged labor > 24 hrs 




any cyanosis 


were there convulsions during delivery 




any abdominal mass 


was the baby born early < 34 wks 


Additional causes of death 


any abdominal pain 


was the baby small < 2500 g 


Fresh stillbirth 


any diarrhea with blood 


was there difficulty breathing at birth 


Macerated stillbirth 


any vomiting with blood 


any congenital malformations 




any acute diarrhea (< 2 wks) 


was this a multiple birth 




any persistent diarrhea (2-4 wks) 


any umbilical infection 





Indicators added following refinement of InterVA for neonatal deaths are highlighted in bold. 
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Table 2 Characteristics of the three studies used as data sources 





Malawi 


Nepal 


Zimbabwe 


Neonatal mortality 
rate 


27/100000], 


33/1000131] 


24/1000[32] 


Study period 


1 year 


3 years 


2 years 


Number of VA 
questionnaires 


169 


385 


180 


Questionnaire 


Mixed open and closed questions 


Mixed open and closed 
questions 


Standard WHO tool incorporating open and closed 
questions (24) 


Interviewers 


5 lay Malawian interviewers with 
secondary education 


Lay local field 
coordinators 


45 midwife enumerators 


Physician review 


2 experienced local pediatricians 
Predefined algorithm 

3 causes of death 


3 experienced local 
pediatricians 
Predefined algorithm 
1 cause of death 


3 experienced local physicians 

International Classification of Diseases and Related 

Health Problems (ICD-10) 

1 cause of death 



Refining InterVA 

Initial refinement of the InterVA model was based on 
100 (59%) physician-reviewed VAs from Malawi. The 
use of these data for refinement was pragmatic in that, 
at the time of refinement, they were the only data avail- 
able. Data from the VA questionnaire were entered in 
the InterVA model, which assigned causes of death and 
associated likelihoods. The open histories, where the 
caregiver reported the events leading to death, were 
coded and also entered in the model. CSMFs obtained 
using the original InterVA and physician review were 
compared. CSMFs were calculated from the InterVA 
output as the sum of the likelihoods computed for each 
single cause of death category, divided by the sum of 
the likelihoods for all causes. For the calculation of 
CSMFs from physician-review data, if more than one 
cause of death was assigned, each was considered as a 
proportion of the total death. Therefore, if a single 
cause of death was assigned by all physicians, or if only 
one was available, it explained 100% of that death. If 
more than one cause of death was attributed, each con- 
tributed an equal proportion of the total 100%. For 
example, if both reviewing physicians assigned prema- 
turity as a cause of death and one of them also assigned 
sepsis, then prematurity contributed 75% and sepsis 25% 
to the death. In this way, every available physician diag- 
nosis contributed to the cause-specific mortality profile, 
avoiding a potential loss of information and bias that 
might have been introduced by using consensus diag- 
noses alone. 

Fifty-four neonatal-death questionnaires were analyzed 
with the original InterVA model. Stillbirths were initially 
excluded, as InterVA was not designed to classify them. 
The results of this first analysis identified the need for 
greater differentiation in the model among causes of 
death in the neonatal period. The InterVA indicators 
and matrix probabilities were therefore reviewed for 
clinical and epidemiological coherence by a pediatrician- 



researcher (SV) and an epidemiologist involved in the 
development of InterVA (EF). Following this initial 
refinement, InterVA was evaluated by comparing case- 
by-case diagnoses with physician-assigned diagnoses for 
the same 100 VA cases, as well the population-level 
CSMFs. A process of refinement and comparisons with 
physician review was undertaken until InterVA elicited 
mortality profiles deemed by the researchers to be plau- 
sible and satisfactorily comparable to physician review. 

Evaluating the refined InterVA model 

The modified InterVA model was evaluated by compar- 
ing population-level CSMFs derived from the two meth- 
ods of interpretation (physician review and InterVA) for 
a further and hitherto-untouched 69 VAs from Malawi, 
385 from Nepal, and 180 from Zimbabwe. A diversity of 
data sources was chosen to assess the performance of 
InterVA in a range of settings. Comparisons of popula- 
tion-level CSMFs were considered paramount as 
InterVA is intended as a public health tool for health 
monitoring and program evaluation, rather than for use 
in clinical settings. Nevertheless, individual level, case- 
by-case comparisons between physician diagnoses and 
InterVA were also conducted and the kappa statistic for 
interrater agreement was calculated to further evaluate 
the InterVA against the only available alternative 
method in our populations [23]. 

Ethical considerations 

The Maimwana study (Malawi) received ethical approval 
from the Malawi National Health Sciences Research 
Committee; the MIRA Makwanpur, Nepal, study was 
approved by the Nepal Health Research Council and the 
Institute of Child Health and Great Ormond Street Hos- 
pital ethics committees; and the Zimbabwe Maternal 
and Perinatal Mortality Study received ethical approval 
from the Medical Research Council of Zimbabwe 
(MRCZ/ A/1368). 
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Results 

Refining InterVA 

InterVA was modified to include two extra cause of 
death categories: fresh stillbirth and macerated stillbirth. 
To define the stillbirth diagnoses and differentiate 
among possible causes of stillbirth and neonatal death, 
nine further indicators were added to the model. The 
resulting modifications to the specific indicators and 
cause of death categories included in InterVA are 
shown in Table 1. As these are extra entities in the 
model, they run in parallel to the existing indicators and 
causes without directly affecting them. 

To compare the InterVA output and physician diagnoses 
in the three settings, some rationalization between the phy- 
sician-assigned causes and the causes obtained from 
InterVA was necessary; therefore, causes of death not 
included in the InterVA classification were grouped as 
"other." Similarly, infectious causes of neonatal deaths, 
including sepsis, pneumonia, and meningitis were grouped 
together into an "infection" category, since the possibilities 
of clinically distinguishing them in newborn infants is diffi- 
cult. There were no cases of neonatal tetanus. The resulting 
CSMFs for InterVA and physician review of the 100 VA 
cases from Malawi used to refine the model are shown in 
Figure 1. In 73% of cases, at least one of the InterVA diag- 
noses agreed with at least one of the physician diagnoses 
(kappa 0.60 (95% confidence interval [CI]: 0.57, 0.70)). 

Evaluation of the Refined InterVA Model 

After refining the model, case-by-case agreement 
between InterVA and reviewing physician diagnoses, for 
69 cases from Malawi, 180 cases from Zimbabwe, and 
385 cases from Nepal, was 83% (kappa 0.76 (0.75 - 
0.80)), 71% (kappa 0.41(0.32-0.51)), and 74% (kappa 0.63 
(0.60-0.63)), respectively. 

CSMFs derived from InterVA and physician review in 
Malawi, Zimbabwe, and Nepal are illustrated in Table 3. 
In Malawi and Zimbabwe, the rank order of causes of 
death was identical when derived from InterVA or physi- 
cian review. In Nepal, the most common cause of death 
according to InterVA was perinatal asphyxia, while it was 
neonatal infections according to physicians. Prematurity 
was diagnosed more commonly by InterVA than by phy- 
sicians in Nepal and Zimbabwe. InterVA detected a 
higher proportion of neonatal infections than physicians 
in Zimbabwe, but a lower proportion in Nepal. 

Stillbirths 

The proportion of total stillbirths identified by the two 
methods of VA interpretation was similar in all three 
settings. Data from Malawi and Nepal allowed for a 
more detailed comparison of the relative proportions of 
fresh and macerated stillbirths (Table 4). 



Multicountry mortality comparison 

Considering the above evaluations and taking the 
refined model to be adequate for the purposes of char- 
acterizing cause compositions of neonatal mortality for 
population health planning and monitoring, a three- 
country comparison of neonatal cause-specific mortality 
was conducted (Figure 2). It showed some differences in 
cause compositions of neonatal deaths, particularly in 
Zimbabwe compared to the other two settings. In Zim- 
babwe, the proportions of preterm births and deaths 
due to infection were higher (44%) than in Malawi 
(28%) or Nepal (20%). 

Discussion 

The deadline for the Millennium Development Goals 
(MDGs) is less than five years away and the need to 
quantify childhood mortality, understand its causes, and 
assess the effects of proposed interventions are central 
to MDG4. Neonatal deaths contribute about 40% of 
under-5 mortality globally [24]. A recent evaluation of 
the INDEPTH network of Health and Demographic Sur- 
veillance Sites [25] calls for all sites to use InterVA for 
coding of causes of death, since such approaches repre- 
sent "the only viable strategy to produce timely and 
comparable cause of death statistics" [26]. Our study has 
revised the InterVA method for verbal autopsy to 
improve its ability to identify causes of stillbirth and 
newborn death and tested it in three populations. 

In this study, physician review was used as a reference 
standard to compare InterVA. The use of physician 
review was the only alternative source of cause of death 
assessment for our study populations. This choice has 
limitations, however. Physicians are influenced by their 
experience, perception, and interpretation of local epide- 
miology [23,27]. Moreover, they mostly use the open 
history to reach a decision and may not account consis- 
tently for all the indicators. Sensitivity and specificity of 
physician review compared with hospital diagnosis in 
neonatal populations varied between 64% and 74% in a 
recent study [20] and concerns about inter- and intrara- 
ter reliability are well described [23]. 

An alternative to physician diagnoses is the use of 
hospital records. Hospital diagnoses have been used to 
establish sensitive, specific, and positive predictive values 
of VA diagnoses [8,12,20]. The main pitfall of hospital 
diagnoses in developing countries, particularly in rural 
settings, is that the CSMF of deaths occurring in hospi- 
tals are likely to be different from the ones in commu- 
nities [23]. There is therefore the risk of increasing 
precision of an interpretative method, defined as its abil- 
ity to reproduce hospital diagnoses in the population 
where it is tested. This would not necessarily produce 
results that are correct when used in populations where 
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Figure 1 Cause-specific mortality fractions from InterVA and 
physician review (PR) for the 100 VA cases from Malawi used 
to develop and refine the model. Note to Figure 1; Other causes 
include "jaundice," "multiple pregnancies," "maternal causes," 
"hypothermia," and "hypoglycemia." 



access to hospitals and health care is limited. Moreover, 
the ability to recognize, recall, and report signs of ill- 
nesses may be different among hospital users and non- 
hospital users. 

The results of InterVA as compared with physician 
review showed an almost identical ranking of causes of 
death. However, differences exist. Some of these differ- 
ences can be explained by the way the model was con- 
structed. Prematurity, for example, was over-diagnosed 
by InterVA in Zimbabwe and Nepal. This probably 
resulted from using a dataset where clinicians were 
allowed more than a single cause of death to refine 
InterVA. In fact, when multiple causes of death are 
allowed, prematurity is more likely to be listed as a 
coexisting cause of death than when a single cause is 
selected [28]. The model did not include "other" as a 
cause of death and would have classified such causes of 
death in one of the available diagnoses. 

InterVA over-diagnosed neonatal infections compared 
with physician review in Zimbabwe, while the opposite 
happened in Nepal. This inconsistency could be due to 
the interpretation of signs by different physicians. 



Table 4 Fresh/macerated split of stillbirths from Malawi 
and Nepal based on interpretation by InterVA and 
physician review 





Malawi 
169 VA 




Nepal 
385 VA 






Physician 
review 


InterVA 


Physician 
review 


InterVA 


Fresh 


23.2 


33.4 


24.7 


39.4 


Macerated 


4.8 


10.9 


19.2 


5.9 


Total Stillbirth 


28.0 


44.3 


44.0 


45.2 



Alternatively, it could be due to the selection of a priori 
probabilities. Greater understanding of the way physi- 
cians decide to value or ignore signs and symptoms may 
help in future refinements and evaluations of InterVA. 

Stillbirths were included for practical and public 
health reasons. Although globally there are about 3.2 
million stillbirths per year, reliable statistics are lacking 
[29]. This information gap has to be addressed. About 
half of perinatal deaths are accounted for by stillbirths 
[29]. The refinements including stillbirths in the model 
eliminate the need to differentiate between live births 
and stillbirths before processing VA data, making the 
method more suitable for use in large surveys. The 
separation between fresh and macerated stillbirths is 
relevant, as prevention strategies are different. The com- 
parisons between InterVA and physician review in 
Malawi and Nepal suggest that InterVA can differentiate 
the two categories, although, as with neonatal deaths, 
there may be room for further refinement. 

Case-by-case agreement was moderate in all datasets, 
however it was lower for Zimbabwe compared to Nepal 
and Malawi. The new indicators and matrix probabilities 
have been chosen and modified on the basis of the per- 
sonal experience of the researchers, and subsequently 
tested and modeled on a subset of the Malawi data. 
There is a risk, therefore, that the tool may be too clo- 
sely modeled on a sub-Saharan African setting (although 
the results from Nepal do not support this) or on a 



Table 3 Comparison of cause-specific mortality fractions according to InterVA and physician review 





Malawi 
69 VA 




Nepal 
385 VA 




Zimbabwe 
180 VA 






Physician 
review 


InterVA 


Physician 
review 


InterVA 


Physician 
review 


InterVA 


Stillbirth 


28.0 


44.3 


44.0 


45.2 


16.5 


20.1 


Perinatal asphyxia 


18.8 


19.4 


21.5 


26.4 


11.3 


9.9 


Neonatal infections 


23.3 


26.0 


28.0 


20.4 


30.6 


44.5 


Prematurity 


10.4 


7.5 


3.1 


6.5 


18.2 


23.9 


Congenital malformations 


2.2 


1.3 


0.8 


0.9 


1.3 


0.4 


Other 


13.3 




1.6 




9.5 




Indeterminate 


4.0 


1.5 


1.0 


0.6 


12.8 


1.3 
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Perinatal asphyxia 
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26.37 


Stillbirth 


20.06 


46.70 


45.23 


Congenital malformation 


0.36 


1.02 


0.91 
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Figure 2 Neonatal death cause compositions from InterVA interpretation of VA data from 169 deaths in 
Zimbabwe, and 385 deaths in Nepal 


0.56 

Malawi, 180 deaths in 



particular research setup. In addition, the modifications 
have so far not been put to a panel of experts and may 
need to be subject to a wider consensus. 

There may be important epidemiological and social 
explanations for the difference in the CSMF in Malawi, 
Zimbabwe, and Nepal. However, even if the interpreta- 
tion of verbal autopsy data by InterVA was consistent, 
methodological variability in other aspects of VA may 
have contributed to the observed cause distribution. 
Indeed, the close comparability of CSMF between 
Malawi and Nepal may to some degree reflect common 
data capture processes that differ from those used in 
Zimbabwe. It is possible that in Nepal and Malawi, the 
populations were part of research areas and might have 
been sensitized to recognize, describe, and recall signs 
of neonatal diseases, while in Zimbabwe the community 
was part of a government surveillance and may have 
responded differently. Nevertheless, this is a reality of all 
VA studies conducted in research settings. Use of lay (in 
Malawi and Nepal) versus health-professional (in Zim- 
babwe) interviewers and their gender may also have had 
an impact on data capture. This highlights the need for 



further methodological research into the effects of other 
aspects of VA. It is likely that a number of strategies 
and international collaborations will be necessary to 
ensure the success of such investigations. 

The modified version of InterVA for stillbirths and 
neonatal deaths produced plausible results when com- 
pared with physicians' opinions but had the advantage 
of being completely internally consistent, allowing stan- 
dardized comparisons of data from different countries. 
Ultimately, standardized methods are essential and their 
application and evaluation in a wide range of settings is 
encouraged. Through wider application, the strengths 
and weakness of InterVA, and VA in general, will 
become more apparent, thereby better informing the 
application and public health utility of surrogate meth- 
ods for measuring mortality in absence of vital registra- 
tion systems. 
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